ENUMERATING THE NASH EQUILIBRIA OF RANK 1-GAMES 



THORSTEN THEOBALD 

Abstract. A bimatrix game (A, B) is called a game of rank k if the rank of the matrix 
A + B is at most k. We consider the problem of enumerating the Nash equilibria in 
(non-degenerate) games of rank 1. In particular, we show that even for games of rank 1 
not all equilibria can be reached by a Lemke-Howson path and present a parametric 
simplex-type algorithm for enumerating all Nash equilibria of a non-degenerate game of 
rank 1. 



1. Introduction 

Bimatrix games belong to the fundamental concepts of game theory. A bimatrix game 
is given by two payoff matrices (A,B), and by Nash's results any bimatrix game has at 
least one equilibrium [TT1, [19] . The problem of computing a Nash equilibrium (named 
by Papadimitriou in 2001 to be the most concrete open question on the boundary of the 
complexity class V |20j) has received much attention in the last years. Chen and Deng 
recently showed that the problem is WAV-complete [3] and (together with Teng [I]) 
that the problem of computing a l/n^^-approximate Nash equilibrium remains WAV- 
complete. Thus it is unlikely that an efficient algorithm exists. 

The corresponding enumeration problem asks to enumerate all equilibria (in the finite 
case) or all the extreme equilibria (in the degenerate case where an infinite number of Nash 
equilibria is possible). The enumeration problem is similar to (but more difficult than) 
enumerating the vertices of a polyhedron given as the intersection of half-spaces. In the 
latter vertex enumeration problem, the Upper Bound Theorem gives a tight estimate for 
the maximal number of vertices, but the analogous problem of determining the maximum 
number of Nash equilibria of a bimatrix game is an open problem (see [22l [23]). 

For the special case of zero-sum games, the set of Nash equilibria defines a polyhedral 
set in the strategy space, and the problem of computing the set of all Nash equilibria is 
equivalent to linear programming (see [5J Ch. 13.2]). Hence, for a non-degenerate zero- 
sum game, the set of Nash equilibria consists of a single point, and thus the enumeration 
problem becomes trivial. 

Recently, Kannan and Theobald [9] have introduced a hierarchy of bimatrix games in 
which the matrix A + B is restricted to be of rank at most k, for some fixed constant k. 
For any fixed k > 1, this class strictly generalizes the class of zero sum-games. Of course, 
the case k = 1 is of particular importance; it is the smallest extension of zero-sum games 
in the hierarchy. 

In contrast to zero-sum games, non-degenerate rank /c-games (for any fixed k > 1) 
can have an arbitrarily large number of equilibria. In particular, this provides a sharp 
separation between the class of rank 1-games and the class of zero-sum games. From 
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the computational viewpoint, Nash equilibria can be efficiently approximated in games 
of fixed rank, but the question of exact polynomial time computability is open even for 
games of rank 1. 

In this paper, we consider the enumeration problem for games of rank 1. Similar to the 
situation above, the rank condition provides additional structure which can be exploited. 
An initial question is whether any equilibrium can be reached by a Lemke-Howson path 
(as defined formally in Section [3]). For arbitrary bimatrix games Aggarwal has shown that 
in general not all equilibria can be reached by a Lemke-Howson path [Tj. By providing 
an example of a rank 1-game for which not all equilibria can be reached in this way, we 
strengthen Aggarwal's result and thus answer our initial question in the negative. 

As main contribution of the paper, we propose a parametric simplex-type algorithm for 
enumerating the Nash equilibria of rank 1-games. This algorithm is based on the tech- 
niques of Konno and Kuno who have investigated linear multiplicative programs (|10j. 
see also [llj). Our problem can be seen as an enumeration problem of generalized lin- 
ear multiplicative programming. Moreover, the situation of games provides additional 
combinatorial structure which can be exploited. 

The paper is structured as follows. In Section [2J we introduce the basic concepts of 
bimatrix games as well as of rank /c-games and review existing work on the enumeration 
of Nash equilibria. In Section [3X we show that not all equilibria can be reached by a 
Lemke-Howson path in a rank 1-game. Then, in Section HI we present the parametric 
simplex-type algorithm for enumerating the Nash equilibria of a non-degenerate rank 1- 
game; finally, we explain how to modify the algorithm so as to cover degenerate situations 
as well. 



2. Preliminaries 

2.1. Bimatrix games. We consider an m x n-bimatrix game with payoff matrices A,B G 
R mxn . Let 

m n 

Si = {x G W m : x i = 1 , x > °} and S 2 = {y eR n : ^2yj = l, y> 0} 

i=l j=l 

be the sets of mixed strategies of the two players, and let Si = {x G W m : Ym=i x « = 1} 
and S% = {y G M. n : YTj=i Vj = 1} denote the underlying affine subspaces. The first 
player (the row player) plays x G Si and the second player (the column player) plays 
y G S 2 - The payoff's for player 1 and player 2 are x T Ay and x T By, respectively. 

Let C^' denote the i-th row of a matrix C (as a row vector), and let Cu\ denote the j-th 
column of C (as a column vector). A pair of mixed strategies (x,y) is a Nash equilibrium 
if 

(2.1) x T Ay > x 1 Ay and x 7 By > x T By 

for all mixed strategies x, y. Equivalently, (x, y) is a Nash equilibrium if and only if 

(2.2) x T Ay = max A^y and x 7 By = max x T B(~\ . 

l<i<m ^ ^ l<j<n UJ 
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A bimatrix game is called non-degenerate if the number of the pure best responses of 
player 1 to a mixed strategy y of player 2 never exceeds the cardinality of the support 
supp y := {j : yj ^ 0} and if the same holds true for the best pure responses of player 2 
(see [23J ) . In the case of a non-degenerate game the set of Nash equilibria consists of 
finitely many isolated points. We remark that there exist various other definitions of 
degeneracy in the literature which are equivalent to that notion (see [23, Theorem 2.10]). 

2.2. Earlier work on enumeration of equilibria. The classical Lemke-Howson algo- 
rithm serves to find one Nash equilibrium in a bimatrix game (|12j. see also [23] )• We 
discuss this algorithm and its (negative) relation to enumeration in more detail in Sec- 
tion [3j 

From the viewpoint of computational complexity, the problem of counting the number 
of Nash equilibria in a bimatrix game is ^P-hard [5]. Hence, the enumeration problem is 
#P-hard. 

The general idea of existing approaches for the enumeration of Nash equilibria is to 
transform the problem into a problem of polyhedral computation. For each game we 
define a pair of polyhedra in which each Nash equilibrium of the game corresponds to a 
complementary vertex pair. 

Definition 2.1. For an m x n-bimatrix game (A, B), define the polyhedra P and Q by 

(2.3) P = {(x,7r 2 )6E m xl : x^O , f B < l T 7i2 , l T x = 1} , 

inequalities l,...,m inequalities m+l,...,m+n 

(2.4) Q = {(y,m) ER n xR : Ay < Itti , |/_g_0 ,l T y = l}, 

inequalities l,...,m inequalities m+l,...,m+n 

where 1 is the all- 1- vector. 

A pair of mixed strategies (x, y) G Si x S 2 is a Nash equilibrium if and only if there 
exist TV 1, 7r 2 G R such that (x, n 2 ) G P, (y, tx\) G Q and for all i G {1, . . . , m + n}, the 
i-th inequality of P or Q is binding. Here, 7Ti and 1T2 represent the payoffs of player 1 
and player 2, respectively. For i G {l,...,m} we call the inequality X{ > the i-th 
nonnegativity inequality of P, and for j G {1, ... ,71} we call the inequality x T B^ < 7r 2 
the j-th best response inequality of P. And analogously for Q. 

Example 2.2. (Taken from [21]). The bimatrix game with payoff matrices 

A = (3 4) ' B = (2 1 e>) 

has 3 Nash equilibria: 




The polyhedra P and Q are shown in Figured! 

The situation becomes more involved when one also wants to handle degenerate games. 
For an arbitrary bimatrix game, the set of all Nash equilibria is in general a non-convex 
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7T 2 



1- 




FlGURE 1. In the example, P is a two-dimensional polyhedron embedded in 
Si, and Q is a three-dimensional polyhedron embedded in S%. The vertical 
direction corresponds to the variables 7T2 and tti, respectively. 



subset, but can be represented as the union of a finite number of polytopes (called maximal 
Nash subsets) in the strategy space (see [T5]). Hence, in the degenerate situation, the 
task is to enumerate all vertices of every maximal Nash subset of a bimatrix game. Every 
equilibrium point is a convex combination of some of these extreme equilibria (cf. [8| [26] ) . 

These concepts can be used to provide algorithms for enumerating Nash equilibria of 
bimatrix games. The earliest ones can be found in Vorob'ev [25] and Mangasarian [15] ; for 
later developments see Mukhamediev [18], Winkels [26], and Audet, Hansen, Jaumard, 



and Sauvard [2J. The latter paper also contains a detailed historical account on these 
algorithms. 

2.3. Games of fixed rank. For a given constant k > 0, a bimatrix game is called a 
game of rank k if the matrix A + B has rank at most k. 

Kannan and Theobald have shown that for any fixed k > 1, the number of Nash 
equilibria in a non-degenerate game of rank k cannot be bounded by a function in terms 
of k [9]. In particular, the case k = 1 stands in sharp contrast to the case k — of 
zero-sum games; there, the number of Nash equilibria in the finite situation is always 1. 
For the case al := m = n, the best known lower bound for the maximal number of Nash 
equilibria of rank 1-games is linear in d: 

Proposition 2.3. For any d G N there exists a non- degenerate d x d-game of rank 1 with 
at least 2d — 1 Nash equilibria. 

A construction achieving this number is given by the d x <i-game (A, B) with 

(2.5) ay = 2ij-i 2 +f, bij = 2ij + i 2 -] 2 

(see [H]). Since A + B = (4ij) it j, the matrix A + B is of rank 1. 

It is not known whether in games of fixed rank a Nash equilibrium can be computed 
in polynomial time. In [9J, the following approximation result was shown. Here, a pair 
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(x,y) of mixed strategies is called an e-approximate equilibrium if 

i(x,y) < e\A + B\, 
where £(x, y) denotes the sum of the losses of the players, 

£(x,y) = max A^y + max x T Br j\ — x T (A + B)y , 

i 3 

and | • | denotes the maximum absolute value of the entries of a matrix. 

Proposition 2.4. Let k be a fixed constant and e > 0. In a game of rank k, an e- 
approximate Nash equilibrium can be found in time poly(£, l/s), where £ is the bit length 
of the input. 

There are several operations on a bimatrix game (A, B) which do not change the set of 
Nash equilibria: 

(1) adding multiples of the all- 1- vector to a given column of A or a given row of B ; 

(2) positive scaling of a given column of A or a given row of B . 

Since these operations can change the rank of the game, the following useful consequence 
is obtained. 

Lemma 2.5. Let (A, B) be a d x d-game of rank d. Then there exists a game of rank 
d — 1 with the same set of Nash equilibria. 

Proof. Assume without loss of generality that d > 2 and that C := A + B is of rank 
exactly d. Then there exists a column j of C which is not a multiple of the all- 1- vector. 
Denote by v the column vector obtained from the entries of the j-th column of C. Since 
the affine line v + . . . , 1) T intersects the (d — 1) -dimensional linear subspace defined 
by the d — 1 other columns, there exists some AeM such that adding A(l, . . . , 1) T to the 
j-th column of C yields a matrix of rank at most d—1. Thus, adding A(l, . . . , 1) T to the 
j-th column of A turns the game (A, B) into a game of rank d—1. □ 

3. Lemke-Howson paths 

In this section, we recall the classical Lemke-Howson algorithm for finding a Nash 
equilibrium in a bimatrix game and then show that not even for games of rank 1 all 
equilibria can be reached by a Lemke-Howson path. 

The Lemke-Howson algorithm is a simplex-type algorithm which can be combinatorially 
described in terms of a graph. In order to define this graph, we start from the polyhedral 
description of the bimatrix game in terms of P and Q. Consider a pair of strategies 
(x, y) G «Si x S21 and let 7i"i and 7r 2 be the resulting payoffs. We label each of the strategies 
x and y by the indices of the inequalities in (12.31) and (12 .4p that are binding. For a 
non-degenerate m x n-game, only the vertices of P have m labels and the vertices of Q 
have n labels, and there do not exist points in P and Q with more than m or n labels, 
respectively (see [23], Theorem 2.7]). 

We define the graphs G\ = (V\,Ei) and G 2 = (V2,E 2 ) as follows. The vertex set V\ 
of Gi consists of the vertices of P, with an additional vertex G R m that has all labels 
in the set {!,... , to}. Two vertices x and x' are connected by an edge if they differ in 
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exactly one label, i.e., if they have m — 1 labels in common. Similarly, let G 2 be the graph 
whose vertex set V 2 consists of the vertices of Q, with an additional vertex G M. n having 
all labels in {m + 1, . . . , m + n}. Two vertices in G 2 are connected if they have n — 1 
labels in common. 

The product graph G\ x G 2 of G\ and G 2 is defined by the vertex set V\ x V 2 , and 
the edges are given by {x} x {y, y'} for vertices x of G\ and edges {y, y'} of G 2 , or by 
{x,x'} x {y} for edges {x,x'} of G\ and vertices y of G 2 . 

From a combinatorial viewpoint, the Lemke-Howson algorithm can now be described 
as follows. Fix an r e {1, . . . ,m + n}. A vertex (x,y) of Gi x G 2 is called r-almost 
completely labeled if the union of the labels is the set {1, . . . , m + n} \ {r}. Since two 
adjacent vertices x and x' in G\ have m — 1 common labels, the edge {x, x'} x {?/} of 
G*i x G 2 is also r-almost completely labeled if y has the remaining n labels except r. And 
similarly for edges {x} x {y, y'} of G\ x G 2 . The Lemke-Howson algorithm starts from the 
artificial equilibrium (0, 0) which has all labels and then follows the unique path where the 
label r is missing. After finitely many steps, it reaches a Nash equilibrium of the game. 

For different choices of r it is possible that we reach different Nash equilibria. This led 
to the early question in the algorithmic study of games on whether any equilibrium can be 
reached by a Lemke-Howson path, i.e., by some choice of r. For general bimatrix games 
it is known that the set of Lemke-Howson paths does not enumerate all Nash equilibria 
(Aggarwal [Ij; see also [23] and the references therein). 

Since games of rank 1 are a very special case of general bimatrix games, the question 
arises whether for a game of rank 1 all equilibria can be reached by a Lemke-Howson path. 
Here, we strengthen the (un-)reachability result by showing that even for games of rank 1 
not all equilibria can be reached. Namely, we consider the rank 1-game 



which is a variation of Aggarwal's example resulting from Lemma 12.51 The polyhedra P 
and Q are shown in Figure [2j 
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Figure 2. The polyhedra P and Q and the labels of the vertices. 
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The game has three Nash equilibria: 



((1, 0) T , (0, 1) T ) , with payoffs (-18, 30) ; 
((0, 1) T , (1, 0) T ) , with payoffs (-8, 20) ; 
and ((§, |f, (|, ff) , with payoffs (-20, 18) . 



With regard to the Lemke-Howson graphs, the locally maximal peaks of the lower hull 
of P are connected with the 0-vertex (which has labels 1, 2) and the maximal peaks in the 
lower hull of Q are connected with the 0-vertex (which has labels 3, 4). Considering all the 
four possible values of r, only the first two equilibria can be obtained via Lemke-Howson 
paths, and the third equilibrium cannot be obtained. E.g., for the initial missing label 
r = 1 we obtain the following path with labels. Here, the two components correspond to 
the graphs G\ and G<i- 



where the last vertex pair gives the Nash equilibrium ((1, Of, (0, if). Hence we can 
conclude: 

Theorem 3.1. There exist games of rank 1 for which not all equilibria can be reached by 
a Lemke-Howson path. 

Let G' be the graph with vertex set V\ x whose edge set is the union (over all r) 
of r-almost completely labeled edges. In a paper of Shapley [21J, the following example 
attributed to Wilson is given which shows that for games of arbitrary rank the graph G' 
can even be disconnected. Let the 3 x 3-game (A, B) be defined by 
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Figure 3. The projections of the polyhedra P and Q in Wilson's exam- 
ples. The picture contains the labels of the points with three labels. The 
underlined numbers show the best responses of the regions. 
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In the graph G', the mixed equilibrium of (A, B) cannot be reached from the artificial 
equilibrium. Hence, even modifications of the Lemke-Howson-algorithm which are allowed 
to change the index of the missing label within the graph traversal cannot enumerate all 
Nash equilibria. 

Applying the rank reduction Lemma [2751 on Wilson's example immediately implies that 
for 3 x 3-games of rank 2, the graph G' can be disconnected. It would be interesting to 
know if this graph can also be disconnected for the case of rank 1-games. While from the 
principle viewpoint this question is decidable, currently we do not know the answer. 

4. A PARAMETRIC ALGORITHM FOR ENUMERATING ALL NASH EQUILIBRIA OF 

RANK 1-GAMES 

In this section, we present a parametric simplex-type enumeration algorithm for games 
of rank 1. For simplicity we concentrate on the situation where the game is non-degenerate 
and the polytopes P and Q are in general position. In Section 14.21 we explain how to 
modify the algorithm to cover degenerate situations as well. An example is presented in 
Section 14.31 

4.1. Non-degenerate situations. We consider the following well-known characteriza- 
tion of a Nash equilibrium in terms of a quadratic program [13]. For any (x, irz) G P and 
(y, 7Ti) G Q, we have 

x T (A + B)y < max A®x + max z T Bf,i < tti + tio 

l<i<m l<j<n ■ ' 

with equality if and only (x,y) is a Nash equilibrium with payoffs 7i"i and tt2. Consider 
the quadratic program 

(QP :) max x T (A + B)y - iz x - n 2 
(4.1) (x,tt 2 ) G P, 

(y,7Ti) G Q. 

Hence, we obtain the following characterization of the Nash equilibria in terms of the 
quadratic program. 

Lemma 4.1. 1) For any feasible solution of QP, the objective value is nonpositive. 
2) A pair (x*, y*) G Si x 5 2 is a Nash equilibrium of the bimatrix game (A, B) if and only 
if there exist 7rJ , 7r| ^ ^ such that {x*, y*, 7r*, 7:%) is a feasible solution of QP with objective 
value 0. 

Note that the inequalities describing the feasible set are linear. Moreover, the objective 
function only depends on the sum A + B rather than on A or B individually. 

If the game is of rank 1, we write A + B in the form A + B = b ■ c T with b G M m , 
c G MP. That is, after a linear variable transformation we want to enumerate the global 
optima of a function of the form / + g where / is the product of two variables and g is a 
linear function. The problem of optimizing a product of two variables over a polyhedron is 
called a linear multiplicative program Q10J, see also [H]). In these references, parametric 
primal-dual simplex-type algorithms were presented to find the optimal value. Based on 
these techniques, we now present an algorithm which enumerates all equilibria of a rank 
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1-game. Since we are starting from a game, we will see below that the relevant bases have 
a special combinatorial structure, where to elements of the basis refer to the polyhedron P 
and n — 1 elements refer to the polyhedron Q. 

By setting £ := c T y, we can write the quadratic program as 

max (x T b)^ — 7r% — tt 2 
(A9 \ O,^) G P, 

[ ] (y,7n) e Q, 

c T y = £■ 

We consider £ as a parameter to the optimization problem. For a given value to £, the 
problem becomes a linear program which we call LP(£). Geometrically, for fixed £ we are 
slicing the feasible set polyhedron by a hyperplane c T y = £ on which the strategy of the 
second player satisfies a particular linear condition. 

Remark 4.2. In the special case where c is a multiple of the all-l-vector, the hyperplane 
defined by c T y = £ is parallel to the hyperplane defined by l T y = 1 (which is part of 
the definition of Q). In that situation, all columns of A + B are identical, and games of 
this type are called row- constant games (see [7]). We will come back to that special case 
below. 

The feasible set of (14.11) is unbounded. However, since y 6 S2 the range [£ m in, £max] of 
£ is bounded, namely 

(4.3) [£min,£max] = [min c T y , max c T y] = [ min c,, max c 3 ] . 

U&S2 y£<52 l<j<™ i<j<n 

Even for fixed £, the feasible region of LP(£) can be unbounded. But by Lemma |4.1[ 
the objective value of QP is bounded from above and hence also LP(£) is bounded from 
above. 

Let X := {1, . . . , to, m + 1, . . . , m + n} be the index set of the inequalities of P, and 
J := {1, . . . , m, m + 1, . . . , m + n} be the index set of the inequalities of Q. 

We consider a fixed parameter value £. If the resulting LP(£) is in general position, 
then the optimal point v of LP(£) is unique and v is a vertex of the feasible set polyhedron 
of LP(£). If £ is a sufficiently generic value (we will come back to this aspect below) then 
v can be uniquely described in terms of a basis, i.e., by a choice I C X of cardinality m 
and J C J of cardinality n — 1. 

For a given £, let (/, J) denote an optimal basis for the linear program LP(£) depending 
on £. The idea of the enumeration algorithm is to keep track on the values of £ where the 
optimal basis changes. This yields the enumeration algorithm in Figure HI 

In order to explain how the update steps can be performed efficiently, we analyze the 
set of the values of the parameter £ which yield the same optimal basis. For this, we 
consider the dual program of the parametric LP (14. 2p . 

Let Ui be the dual variable corresponding to the z-th inequality of P, 1 < i < m + n, 
and u m+n+ j be the dual variable corresponding to the j-th inequality of Q, 1 < j < m + n. 
Further let M2( m +n)+i and U2( m + n )+2 be the dual variables of the equations l T x = 1 and 
l T y = 1, and U2( m +n)+3 be the dual variable of the equation c T y = £. The dual variables 
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While £ < U do 

Compute the optimal point z := z(£) of LP(£); 
If z(^) has objective value then 

Let x,y be the strategies played in ,z(£); 

Output "Nash equilibrium:", 
Compute the smallest £' > £ where the optimal basis changes and the new optimal basis; 



Figure 4. Enumeration algorithm 



Mi, ... , U2( m + n ) are sign- restricted while the dual variables «2(m+n)+ij ^2(m+n)+3 are 
not. 

We set z = (x, y, 7Ti, 7r 2 ) and the constants If = 2(m + n) and A 7 " = m + n + 2. Let 
M 1 eR KxN and d G R K be defined by 
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Further, let M 2 G 



5x7V 



Mo 



and e 2 G R 3 be defined by 
c T 



e 2 




Then the feasible region of the linear program LP(£) is given by M%z < ei, M 2 z = e 2 . 
For fixed £, the dual of LP(£) is 



min (ef , e^M 



,0.,-l,-l) J , 



n times 



Ml, ... , Mjf > 



where m = (mi, . . . , mk +3 ) is the vector of dual variables. 

By the complementary slackness conditions for linear programming, the optimal solu- 
tion of the dual program satisfies 



(4.4) 



U; 



for alH g" / and 



u 



m+n+j 



for all j (£J. 



A basis (J, J) of LP(£) is also an optimal basis for all those programs LP(£') for which 
the point described by (J, J) is feasible and for which there exists a feasible solution to 
the dual program satisfying the complementarity condition (14.4p . 

For the special case of a zero-sum game, the parametric formulation degenerates to the 
well-known pair of dual linear programs associated with the game. Namely, if b = c = 
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then £ min = £ max = and the matrix M\ becomes 
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For £ = 0, the dual then becomes 
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U!,...,Uk > 0. 



Hence we can conclude: 

Lemma 4.3. If the game is a zero-sum game (i.e., A + B = 0) then the parametric 
problem (14.21) is only feasible for £ = 0. In this case, the constraints of the dual LP 
coincide with the constraints of the primal program under the identifications X{ = u m+n+i , 
1 < i < m, yj = u m +j, 1 < j < n, 7Ti = u 2 ( m +n)+2, 7T2 = «2(m+n)+i, and considering 
ui,...,u m and «2 m +n+i,---,% as slack variables; the objective functions are additive 
inverses under these identifications. 

Similarly, for row-constant games the range of ^-values consists of a single point, and 
the equilibria of these games can be phrased as linear programs. Namely, if there exist 
constants «i,...,« ro el with 

aij + bij = Ui for 1 < % < m, 1 < j < n 

then the zero-sum game (A', B') defined by b'^ = b^ — m has the same Nash equilibria 
as (A,B). 

From now on, let the game (A, B) neither be a zero-sum game nor a row-constant game. 
Let (J, J) be an optimal basis of LP(£) for some parameter value £. Let B = I U J, and 
for a matrix A let A& be the submatrix of A with rows in B. By our assumption, the 
system of linear equations 

(4.5) (M 1 ) B z = ( ei ) fl , 

(4.6) M 2 z = e 2 

has a unique solution. Let z(£) be the solution point of this system. In order to check 
whether there exists a dual solution satisfying the complementary slackness conditions, 
set B' = B U {K + 1, K + 2, K + 3}. Compute 

((M^IMJ)- 1 ^, . . . , bU, 0, • • • , 0, -1, -If 

and set all components of u indexed by {1, ... , 2K}\B to zero to obtain a vector u = u(£) 
with (Mj r |M 2 r ) , u(£) = (&i£, . . . , 6 m £, 0, . . . , 0, — 1, — 1) T and which satisfies the complemen- 
tarity conditions. 
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Lemma 4.4. Let z(£) be defined by solving (14.51) and (14.61) for z and u(£) be as described 
before. The set o/£ such that B is an optimal basis o/LP(£) is given by the two conditions 



Proof. The first condition is satisfied if and only if z(£) is feasible. For a vertex z(£) of 
the feasible set polyhedron the second condition is satisfied if and only if z(£) is optimal 



For a variable £, both conditions in (14. 7p and (14. 8 p are linear conditions in £. Hence, the 
range of ^-values in which both conditions are satisfied defines an interval. Let [0:1,0:2] 
and [fli, fa] be the intervals defined by (14. 7B and ( 14. 8L respectively. Then [£1,^2] : = 
[max{ox, /3i}, min{o2, fa}] is the interval for £ in which both conditions are satisfied. 
Since (/, J) is an optimal basis for some £, the interval is nonempty. We distinguish two 
cases: 

Case £2 = 02. Then for the value £ = £2, there are m + n inequalities, indexed by 
B U {j} for some j, which are binding in the primal program. One of the inequalities i of 
the current vector z(£) becomes violated for £ > £2- We remove the index i from B and 
replace it by the index j. Since i was chosen to be the earliest violated inequality, after 
this dual simplex step the new basis defines a feasible and optimal point for sufficiently 
small £ > £ 2 - 

From the viewpoint of the game, we can distinguish the following subcases correspond- 
ing to the set of m + n active inequalities. 

(1) m + 1 inequalities for x and n — 1 inequalities for y are binding. 

Then i and j refer to indices of inequalities for the polyhedron P. If i G 
{l,...,m} then one of the unplayed pure strategies of the first player is now 
effectively played. If i G {m + 1, . . . , m + n} then one of the previous best pure 
responses of the second player becomes a suboptimal response. 

If j G {l,...,m} then one of the played pure strategies of the first player 
becomes unplayed. If j G {m + 1, . . . , m + n} then one of the previous suboptimal 
pure responses of the second player becomes a best response. 

(2) m inequalities for x and n inequalities for y are binding. 

Then % and j refer to indices of inequalities for the polyhedron Q. If % G 
{1, . . . , m} then one of the previous best pure responses of the first player be- 
comes a suboptimal response. If i G {m + 1, . . . , m + n} then one of the unplayed 
pure strategies of the second player is now effectively played. 

If j G {1, . . . , m} then one of the previous suboptimal pure responses of the first 
player becomes a best response. If j G {m + 1, . . . , m + n} then one of the played 
pure strategies of the second player becomes unplayed. 

Case £ 2 — /^2- Then for the value £ = £2 there exists an index i G B such that the dual 
variable yi becomes zero. For £ > £2 the current vector z(£) is no longer optimal. We 
remove the index i from the basis and perform a simplex step moving along an edge of 



(4.7) 



M lZ (0 < ex, 




for LP(0- 



□ 
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the polyhedron. Since i was chosen to be the earliest violated optimality condition, this 
simplex step gives a new optimal basis B' . 
We conclude: 

Theorem 4.5. Let (A,B) be a non-degenerate bimatrix game of rank 1. Then the para- 
metric algorithm enumerates all Nash equilibria. The running time of the algorithm is 
polynomial in the product fo(P) ■ fo(Q), where fo(P) and fo(Q) denote the number of 
vertices of P and Q respectively. 

Proof. For those parameter values £ where the basis does not change, the optimal basis 
is given by an m-element subset I C X corresponding to a vertex of P and by an {n — 1)- 
element subset J C J that can be extended to an n-element subset characterizing a vertex 
of Q. □ 

As mentioned in Section [2] note that in general not every vertex pair of P and Q 
corresponds to a Nash equilibrium. 

4.2. Degenerate games. If the game is degenerate then there are two issues. The 
polyhedra might not be simple and the number of Nash equilibria can become infinite. In 
order to resolve the first of these points, we have to cope with the same issues as in the 
case of the simplex algorithm (in particular, the issue of possible cycling when changing 
a basis). However, with the same techniques as for linear programming (such as symbolic 
perturbation, lexicographic ordering), these situations can be resolved. 

In order to extend the algorithm to degenerate games with an infinite number of equi- 
libria as well, by Section [2T21 the extreme equilibria are sufficient to determine all maximal 
Nash subsets. Indeed, our method can be modified to find all extreme equilibria even in 
degenerate cases. 

4.3. Example. We consider the rank 1-game from (I2.5P for d = 2: 

i.e., A+B = bc T with b = (2, 4) T , c = (2, 4) T . We have mm yG s 2 c T y = 2 and max ye ,s 2 c T y = 
4. For the value of £ = 2, the inequalities with indices in {2, 3, 5, 8} are binding. 





objective 


binding 


2 





{2,3,5,8} 


*e(2,|) 


< 


{2,3,5} 


5 


i 

4 


{2,3,4,5} 


1 e (f,3) 


< 


{3,4,5} 


3 





{3,4,5,6} 


£e(3,I) 


< 


{3,4,6} 


7 


1 

4 


{1,3,4,6} 


|e(!,4) 


< 


{1,4,6} 


4 





{1,4,6,7} 
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For £ = |, in the optimal situation we obtain uniquely y = (§,7) and iz\ — -4. The 
optimal values for x and 7r 2 are not unique; by substituting x 2 = 1 — x\ we can analyze 
the situation locally around the parameter value £ = | in the (x%, 7r 2 )-plane (see Figure [5] 
but note that the x and y-axis are scaled differently). The induced optimization problem 
is 

max (-2xi + 4)£ - tt 2 - f 

< x 1 < 1, (1,2) 
-5 Xl + 7 < tt 2 , (3) 
-7x 1 + 8 < vr 2 . (4) 

For £ = § the objective function is —5a;i _ ^2 + x> so that both points pi := (1,2) T 
and p 2 := (|, |) T (as well as all convex combinations) are optimal. The first one comes 
from the basis {2, 3, 5} and the second one from the basis {3, 4, 5}. For some sufficiently 
small e, in the case of £ = | — e, the first of these bases is optimal, and in the case of 
£ = I + e the second of these bases is optimal. 




Figure 5. The situation for £ = |. Since the X\- and 7r 2 -axes are scaled 
differently, the outer normal vectors to the lines do not look orthogonal in 
the figure. 



5. Conclusion and outlook 

We have presented an enumeration algorithm for the Nash equilibria of non-degenerate 
rank 1-games which is adapted to the rank 1-structure. Our running time estimate was 
based on bounding the number of vertices of the polyhedra involved. An open question 
is in how far one can improve the running time analysis of the algorithm. 

A widely open question is to develop enumeration algorithms for games of rank k (for 
fixed k EN) exploiting the low-rank structure. 

Acknowledgments. Thanks to the reviewers for very helpful comments and corrections. 
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